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10 . 1 Introduction 

m . 

This paper is concerned with some aspects of the search for conservation laws for mechanical 
systems with nonholonomic constraints (i.e. velocity-dependent constraints). Such constraints 
show up naturally e.g. in robotics and in control theory when one considers rigid bodies rolling 
over a surface or over each other, or possessing a contact point with a surface in the form 
of a knife edge. Standard references to the differential geometric approach to systems with 
^ ■ nonholonomic constraints are the recent books (2j[7J[Il]- In this paper we follow a Lagrangian 

formulation of nonholonomic systems. The constraints will be assumed to be linear in velocities 
and both the Lagrangian and the constraints will be assumed to be independent of time. 

We shall investigate the relation between first integrals (or constants of the motion) on the 
one hand and symmetries on the other hand. In the absence of nonholonomic constraints, this 
relation is often described by what is commonly called 'Noether's (first) theorem'. Although 
anyone working in the field is familiar with this terminology (see [27J for a recent book on the 
history of the subject), a quick scan through the literature immediately reveals that the precise 
formulation of the theorem is somewhat subject to personal taste. In fact, the terminology 
'Noether's theorem' is associated with many slightly different manifestations of the same group 
of underlying ideas. For example, some authors relate Noether's theorem to invariance transfor- 
mations of the action functional (up to a gauge term) depending on position (and possibly time) 
only, while others also use the term Noether's theorem for generalizations to velocity-dependent 
transformations. At the infinitesimal level, the first point of view relates to the existence of a 
vector field on the configuration space whose complete or tangent lift to velocity space preserves 
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the Lagrangian, while the second viewpoint is related to the existence of a vector field on ve- 
locity space, not necessarily projectable to one on the configuration space, that, among other 
things, preserves the so-called Cartan 2-form. Throughout this paper, we shall refer only to the 
first viewpoint as the 'Noether theorem', while we shall call the second viewpoint the 'Cartan 
form approach'. For unconstrained Lagrangian systems, the relation between the two viewpoints 
was well established in e.g. [9], where it is argued that the Cartan form approach is superior 
since it embodies a simple direct correspondence between symmetries and first integrals, and in 
any case contains the Noether theorem (in the first sense) as a special case. Besides the two 
above-mentioned versions of the Noether theorem, one may find in the literature many other 
generalizations. For a review on those we refer to [32j . 

Given the lack of consensus for standard Lagrangian systems, it is no surprise that the situation 
is even more troublesome for the translation of the ideas behind the Noether theorem to the 
context of Lagrangian systems with additional nonholonomic constraints. It is well-known that 
for those systems the relation between symmetries and constants of the motion is no longer as 
natural as it is for the unconstrained case. Nevertheless, many people have investigated the 
conditions under which symmetry properties of the system do lead to conservation laws. These 
investigations have resulted mostly in generalizations of the two viewpoints described above for 
the unconstrained situation. For example, papers discussing the first point of view are those by 
Fasso and co-workers |19|. 1201 [2T| [22] and Iliev et al. [24\ I25j . Results concerning the Cartan 
form viewpoint may be found in e.g. |23j (for time-dependent systems) and [H [8j [15] (from 
the Hamiltonian point of view). Actually, the bulk of the literature focusses on a very special 
case: the one where the system is invariant under the tangent lift of an action of a Lie group 
(see e.g. [3j SJ EJ EJ EH [33]) (none of these lists is meant to be exhaustive). Unfortunately, for 
nonholonomic systems it is far from immediately obvious how the two distinct viewpoints are 
to be compared. The main goal of this paper is to come to a transparent description of these 
two levels of generalization and of their interaction. 

Throughout the paper we shall take advantage of a simple and well-known observation we have 
used also in previous publications [HI [12] : the dynamics can be represented by means of a vector 
field, and so can any symmetry (at an infinitesimal level). In Section 2 we recall the version of 
the d'Alembert principle we stated in [11] and we interpret it here in terms of the fibre metric 
given by the Hessian of the Lagrangian. In the following section we discuss some generalities 
concerning the restriction of the Cartan 2-form to the constraint submanifold, we present what 
we believe to be the most general relations between symmetries and constants of the motion in 
the Cartan form approach and we discuss the special case where the nonholonomic distribution is 
maximally non-integrable. In the fourth section we translate the nonholonomic Noether theorem 
of [21] to our formalism. We further show how it can be derived from the results in the previous 
section and we discuss some special cases. The last section contains applications of the previous 
results to Riemannian submanifolds, to Lagrangians of mechanical type, and to the search for 
linear and quadratic integrals. 

We shall assume that the reader is familiar with the basic tools and concepts needed for the 
geometric description of Lagrangian systems, such as the vertical and complete lifts X v and X° 
of a vector field X on Q, the vertical endomorphism S, the concept of a second-order ordinary 
differential equation vector field, etc. For definitions and basic properties we refer to e.g. [131117]. 
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2 The Lagrange-d'Alembert principle and the fibre metric 



Our starting point is the formulation of the equations determining the dynamics of a regular 
Lagrangian system subject to nonholonomic linear constraints which we gave in |11J . The 
constraints may be defined by either a distribution D on configuration space Q (the constraint 
distribution), or a submanifold C of TQ (the constraint submanifold) . The two are related as 
follows: C = {u £ TQ : u £ V q C T q Q,q = r(u)}; r stands here for the tangent bundle 
projection TQ — > Q. We assume that the dimension of each T> q is constant and equal to m. 
Throughout the paper we shall denote by i the injection C — > TQ. A vector field r on C is said 
to be of second- order type if it satisfies r* u r = u for all u € C. A Lagrangian function L on 
TQ is said to be regular with respect to T> if for any local basis of T>, 1 < a < m, the 

symmetric m x m matrix whose entries are X^(XJ(L)) (functions on C) is nonsingular. In [11] 
we proved the following proposition. 

Proposition 1. Let L be a Lagrangian on TQ which is regular with respect to T>. Then there is 
a unique vector field T on C which is of second- order type, is tangent to C, and is such that on C 

T{Z V {L)) -Z C (L) = 

for all Z £ T>. Moreover, V may be determined from the equations 

T(XZ(L))-X%(L) = 0, a = l,2,...m, 

on C, where {X a } is any local basis for T>. 

This is our version of the Lagrange-d'Alembert principle; the vector field T is the dynamical 
field of the constrained system. 

We shall make use of the following result. For any smooth map between manifolds <fi : M N, 
if £ € X(M) (the module of smooth vector fields on M) and n £ X(N) are ^-related and x is 
any form on N then C^((p*x) = <f>*(£>riX)- We shall apply it with 4> = l, when £ is tangent to C 
(so that the restriction of £ to C is i-related to £ itself), to obtain C^(i*x) = t*(C^x) (where on 
the left-hand side £ should be understood as £|c)- This holds, mutatis mutandis, when x is a 
function. In a similar vein, if £ is tangent to C then £j i*x = x)- 

We shall assume that the Lagrangian L, in addition to being regular with respect to 2?, is also 
regular with respect to TQ (i.e. that L is a regular Lagrangian in the standard sense). Remark 
that in case the Hessian of L is positive definite, L will automatically be regular with respect 
to both TQ and T>. The dynamical vector field V defined in the proposition above should not 
be confused with the standard Euler-Lagrange field Tq of L on TQ. The latter is uniquely 
determined by the condition that on TQ (i.e. not only on C) 

T (Z V (L)) - Z C (L) = 

for all vector fields on Q (i.e. not only those in D). We shall not make a notational distinction 
between Tq and its restriction to C. It is easy to see that, on C, T — Tq is vertical (with respect 
to the projection r). 

Let g be the Hessian of L with respect to fibre coordinates. It can be considered as a fibre metric 
on TQ, so that it defines a scalar product of vertical vectors, as follows. Let us equip each fibre of 
TQ, which is of course a vector space, with the flat connection, and denote its covariant derivative 
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operator by V . Then for any pair of vertical vector fields V and W, VyW makes sense, and is 
a vertical vector field. Moreover V a v W - V° W V = [V, W]. Set g(V, W) = V(W(L)) - V^W(L). 
Then g is bilinear over C°°(TQ) and symmetric. If we take V = X v , W = Y v for vector fields 
X, Y on Q we get back the usual definition of the Hessian, X V (Y V (L)) = g(X v ,Y V ), since 
VyY v = for any vertical V; but note that we have extended the ring of coefficients from 
C°°(Q) to C°°(TQ). 

The interesting point is that T — To is perpendicular (with respect to g) to X v for all X in T>. So 
in an obvious sense, T is the perpendicular projection of the unconstrained dynamics Tq onto C. 
The basic fact is evident: on C, (T -T )(X V (L)) = for all X in V, since T(X V (L)) = X C (L)) = 
T {X V (L)). In view of the definition of g it follows that g(T-T ,X v ) = for all IgP. In fact 
g(T — Tq, V) = for any vertical vector field V tangent to C; or if we restrict attention to any 
fibre T q Q, we see that (r — To)x q Q is normal, with respect to g\r q Q, to the submanifold (indeed, 
linear subspace) C q of T q Q. It will be convenient to say that a vector field V is fibre-normal to 
C if it is vertical and g(V, W) = for all vertical vector fields W which are tangent to C. Then 
r — To is fibre-normal to C. 

This discussion may be summarized in the form of the following proposition, which amounts to 
an alternative formulation of the Lagrange-d' Alembert principle of Proposition [TJ 

Proposition 2. Let L be a Lagrangian on TQ which is regular with respect to both V and TQ. 
Then the dynamical field T of the corresponding constrained system is the unique vector field 
tangent to C such that T — Tq is vertical and fibre-normal to C with respect to the fibre metric 
determined by L. 

Note that T is necessarily of second-order type since it differs from a second-order differential 
equation field by a vertical field. 

In what follows we shall always implicitly assume that L is regular with respect to both T> and 
TQ. 

The fact that V is the image of Tq under a projection operator appears in a paper of de Leon 
and Martin de Diego [16j, but the result is derived there in a way which somewhat obscures the 
role of the Hessian g. 

3 The Cartan form approach 

Let S be the vertical endomorphism on TQ. It can be defined by means of its action on vertical 
and complete lifts: S(X°) = X v and S(X V ) = 0. It is common to call the form 9l = dL o S, or 
more succinctly S(dL), the Cartan 1-form; the Cartan 2-form is ojl = dOj^. Some authors also use 
'Poincare-Cartan forms' or simply 'Poincare forms' for these forms. According to [28J Poincare 
was the first to introduce the forms, while Cartan defined their extension to the context of time- 
dependent Lagrangians. The Cartan 2-form can be thought of as the Kahler lift of the Hessian 
g. As such, ojl evaluated on a pair of vertical vectors gives zero, ljl(S(X),Y) = cjl(S(Y),X), 
and if V is vertical then ojl(V,Z) = g(V, S(Z)). The next subsection is concerned with the 
properties of the restriction of the form ujl to the constraint submanifold C, in both senses: ujl\c 
and 

At any point u G C C TQ, the tangent space T U C projects onto T q Q (where q = r(u)), and the 
kernel of the projection is the vertical lift of T> q . We denote the vertical lift of T> q to u by T>^, 
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and the vertical subspace of T U (TQ) by V U (TQ). The subspace of V U (TQ) consisting of vectors 
fibre-normal to V^, say (V^)- 1 -, is complementary to T U C in T U (TQ). An obvious move is to 
consider those vectors £ G T U C such that S{£) G T>^, and those such that S(^) G (X>^)^. Denote 
the former by T) u , the latter by T>^. Then T> u and T>^ are subspaces of T U C, which together 
span it; but they are not complementary, in fact T> u Pi T>^ = TX^. 

Let us consider the distribution V : u i— > V u in greater detail. It is clearly projectable to Q, and 
T|c*£> = V. Its kernel under projection is just V Y , that is, the vertical distribution on C. So the 
fact that t\c*X> = V defines V; we can write V u = T| Cjm -1 (D T ( u )). 

We can now use the distributions V and V 1 - to construct a convenient local basis for X(TQ), or 
in other words an anholonomic frame on TQ, adapted to the study of the restriction of ojl to C. 

3.1 Construction of a suitable anholonomic frame 

We can evidently find local vector fields X a G T> and X a G V 1 - such that 

• for each u, {T* u X a ,T* u X a } is a basis for T q Q; 

• {S(X a )} is a basis for V v ; 

• {S(X a )} is a basis for (P v ) x . 

We could start with a basis {X{\ = {X a ,X a } of vector fields on Q (an anholonomic frame on Q) 
with {A" a } a basis for P. Take for X a some projection of X% into TC along vertical vectors: then 
S(X a ) = X%. By suitable similar modifications of the X£ we can fulfill the other requirements. 
Note that the vector fields X, L cannot be assumed to be projectable to Q. 

Denote S(X a ) by y a and S(X a ) by y a . Then {X a ,X a ,y a } is a local basis for X(C), and 
{X a , X a , y a , y a } is a local basis for X(TQ). We have g(y a , 34) = 0, where g is the fibre metric. 
Let us set g{y a ,yp) = g a p and #(34,34)) = 5afc- If ^ is regular with respect to V, (g a p) is 
nonsingular. From the above relation between ool and g, we get 

^h{y a , Xp) = g af3 , uj L (y a , X b ) = g ab , LO L (y a , X a ) = LO L (y a , X a ) = 0. 

Set 0JL{X a ,Xp) = uj a p and so on. If we change X a to X a = X a — g al3 uj a /3ya then X a G 

s(x a ) = y a , but 

WL(Xa, Xa) = U aa — g^UJ a pga~j = 0. 

So without loss of generality we can assume that oj aa = 0. We are still free to modify X a 
similarly, by the addition of a linear combination of the 34- Since g aa = 0, this won't alter the 
value of uj aa . Let X a = X a — \g^ u^y^. Then 

U)L(X a , Xp) = ^>a/3 - \g l5 ^a59^ + \g l6 Upsg al = 0. 

So without loss of generality we can further assume that uj a p = 0. 

We shall assume from now on that these modifications have been made, and we shall drop the 
overbars. The constructions of the previous paragraphs may then be summarized as follows. 

Consider the following three distributions on C. 

• V v = (34), of dimension dimD = m, is the vertical lift of V to C; 
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• V = (X a ), of dimension m, projects onto V, satisfies S(V) = T> v , is a complement to T> v 
in V, and is isotropic with respect to ojl] 

• V T = (X a ), of dimension n — m, projects onto a complement to V in X(Q), is a complement 
to V in X(C), and is symplectically orthogonal to V; it satisfies S(V T ) = (D v ) ± . 

These three distributions are related to V, to X(C) and to X(TQ) as follows: 

f> = T> v V 
X(C) =V®V T 
X(TQ) = X{C) (P v ) x . 

Let {$ a ,'d a ,i£ a ,(p a } be the local basis of 1-forms on TQ, or coframe, which is dual to the basis 
{X a ,X a ,y a ,y a } of X(TQ). Then W = S^) (i = a,a). Also, {$ a , $ a , <p a }, when restricted to 
acting on X(C), is a local basis of 1-forms on C (or more accurately, {l*^, L*$ a , L*ip a } is a basis 
for 1-forms on C; however, we shall usually ignore this refinement, trusting that it will be clear 
from the context what is intended); and {(p a ) is X(C)°, the annihilator of X(C), so that i*(p a = 0. 
In terms of this coframe 

ul\c = 9a^ a A + g ab cp a A $ b + A 

Evidently for w^lc to be symplectic (that is, for L to be regular) it must be the case that {g a b) 
is nonsingular (this will automatically be so if the Hessian of L is positive definite). Assuming 
L is regular, at any ueC the symplectic orthogonal to T U C in T U (TQ) is given by 

(T U C) T = (X a + g bc u ab y c ) u . 
The characteristic subspace x^^l) of l*u>l is (T U C) T n T U C, and is therefore given by 

X (i*uj L ) = {?X a : {?u ab = 0} = X (wlI^t) • 

In summary, we have the following proposition. 

Proposition 3. We can find a frame {X a , X a ,y a ,y a } on TQ such that (y a ) = V v , (X a ,y a ) = 
V, (X a , X a , y a ) = X(C), (y a ) = (I' V )" L ; and such that with respect to the dual coframe i9 a , ip a , 
we have ((p a ) = X(C)° and 

= g a p<P a A ^ + \u ab r A $ b . 

(Strictly speaking there should be an i* on each 1-form on the right-hand side of the final 
formula.) We shall use this expression for l*ojl repeatedly below. Note that at u € C the 
first term on the right-hand side is the restriction of l*ujl to V u , the second its restriction to 
T>J; these are complementary subspaces of T U C and are symplectically orthogonal. Moreover, 
x{i*wl) C f)J; as is evident also from the expression above, l*lol is nonsingular on V u . 

Of course, we do not claim that the frame described in the proposition is unique. 
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3.2 First integrals and symmetries 



We are now ready to establish some relations between first integrals on the one hand and 
symmetries on the other. Some of the results in this section are closely related to analogous 
results that may be found in [23] (albeit in the context of time-dependent constraints) and [Hll5j 
(albeit in a Hamiltonian formalism). The reader should also keep in mind that by setting the 
constraint submanifold C equal to the whole tangent manifold TQ, one recovers the well-known 
relations between symmetries and constants of motion, to be found in e.g. 132] . 

Let us now work locally on C, in terms of the local bases of vector fields {X a , y a , X a }, and the 
dual basis of 1-forms (i.e. sections of T*C ->• C) ip a , i? a }. Note that {tf a } is a basis for T>°, 
the annihilator of T>. 

Given a 1-form tp on C, there need not be a vector field Z on C such that Zi l*ujl = ip'- it is 
necessary that ijj(W) = for all W G xi^^i)', and if such Z exists it won't be unique. However, 
we have the following important result, which establishes a modified construction for obtaining 
vector fields from 1-forms. 

Proposition 4. For any 1-form ip on C, there is a unique vector field Z on C such that Z £2) 
and Zji*u L -^e V° . 

Proof. The vector field Z given by 

z = g a ^(x^)y a -(y p ^)x a ) 

is well defined because of the assumed regularity of L. It belongs to T> and satisfies Za l*ul — i^ G 
D°. Now if Y G T> then Yj l*ujl G ip a ), by inspection of the expression for l*ojl\ so if Y G T> 
and Yj l*lol G T>° then Y = 0, and the vector field Z displayed above is uniquely determined. □ 

The penultimate statement is worth recording separately. 
Corollary 1. If Z G V and Za G V° then Z = 0. 

It is well-known (see |13| I17j) that Tq, the dynamical vector field of the unconstrained system, 
can be characterized as the unique second-order vector field satisfying ToJ ul + dE^ = 0. The 
function El = A(L) — L is the energy of the Lagrangian L and A is the Liouville field (the 
infintesimal generator of scaling transformations). It follows that the dynamical vector field 
of the constrained system, T, is determined by the 1-form —d{i*EL) in the way described in 
Proposition UJ Recall first of all that r G T> (it is tangent to C and at each u G C, t* u T = u G 
T^t{u))\ an d secondly that V — Tq G (V v )- l : say T = Tq + j a y a (the 7° are multipliers in some 
manifestation). Then on C 

Tj UJ L + dE L = (T - T )J UJ L = l^aA U L = l h 9ab® a - 

Now apply 1*, and the result follows. 

Theorem 1. For f a function on C, let Zj be the unique vector field on C such that Zf G T> 
and Zjj l*ujl — df G T>° . Then f is a first integral of T if and only if Zj(l*El) = 0. 
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Proof. Since T, Zf G V, 

r(/) = rj (Z/j l*lo l ) = -z f j (rj = z f (t m E L ), 

and the result follows. □ 



This procedure sets up a 1-1 correspondence between local first integrals of T (determined up 
to the addition of a constant) and vector fields Z G V such that Cz{^*^l) S d(V°) (i.e. such 
that £ z (/,*cj l ) = # for some (ft G X>°) and Z(l*E l ) = 0. Indeed, C z (t*u L ) = d(Zj l*uj l ) = d<j> 
for (ft G T>° if and only if Zj l*ujl differs from (ft G 2?° by a closed, and so locally exact, 1-form. 
If we set C z (l*0 l ) -cft = dF, then / above is given by / = F - l*B l (Z) = F- S{Z){L). 

Notice that l*El is a first integral, with corresponding vector field T, though the proposition is 
vacuous in this case; however, it is clear from the facts that Tj l*ojl + d(i*Ei) G T>° and r G V 
that T{l*E l ) = 0. 

We can now consider the question of whether there is any correlation between conserved quan- 
tities and symmetries for constrained systems. By an infinitesimal symmetry of V we mean a 
vector field Z tangent to C such that £z^ = 0. We shall give two different sets of conditions for 
deriving a symmetry from a first integral: the first result involves conditions on T, the second on 
the vector field Zf G T> corresponding to a first integral /. We derive them both as corollaries 
of the following proposition. 

Proposition 5. Let Z be a vector field tangent to C such that Dj Cz(l*col) C T>° and Cz(L)) C 
T>. Let f be any function on C and Zf G T> the corresponding vector field. Then Cz(Zf)j — 
d{Z{f)) G V°. If, further, Z(f) = then C Z {Z } ) = 0. 

Proof. We have Zfi l*ujl — df = (ft G T>°. Take the Lie derivative with respect to Z to obtain 

C z {Z f )j l*co l - d{Z{f)) = -Z f j C z (i*u L ) + £z4>- 

Since Zf G T>, by assumption the first term on the right-hand side belongs to T>° . Now for any 
Y G V, 

CzcftiY) = Z(<ft(Y)) - <ft(C z Y) = 

since C Z Y G V by assumption. Thus C Z (ft G V° . So C z (Z f )j - d(Z(f)) G V° . If Z(f) = 
(or indeed if Z(f) is constant) then Cz{Zf) G T> while Cz(Zf)j G V° , and so Cz(Zf) = 
by Corollary [TJ □ 

In the course of the proof we have in effect established that Czi'D) C V if and only if £^(2?°) C 
V°. 

The condition Vj Cz{i*(jJl) C V° is equivalent to Czi^^L)^,^) = 0, or in other words the 
distribution T> is isotropic for Cz{^*^l)] but in view of the appeal to Corollary [1] the formulation 
in the statement of the proposition seems preferable. 

Corollary 2. Suppose that T satisfies the conditions specified for Z in the proposition above. 
Then for any first integral f , Zf is a symmetry ofT. 

Proof. Since T(f) = 0, C Zf T = -C v {Z f ) = 0. □ 

Corollary 3. Let f be a first integral ofT, Zf the corresponding vector field. Suppose that Zf 
satisfies the conditions specified for Z in the proposition above. Then Zf is a symmetry ofT. 
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Proof. A certain amount of mental gymnastics is required here: we take Zf for Z in the proposi- 
tion above, and T for Zf (that is, we take —l*El for /). Since / is a first integral, Zf(i*Ei^) = 0, 
and the final conclusion of the proposition holds. □ 



From the second of these corollaries we obtain the following theorem, which gives sufficient 
conditions on a vector field Z G D for it to both be a symmetry and generate a first integral. 

Theorem 2. Let Z G £> be such that VjC z {l*uj l ) C t>° , C Z (P) C V, C z {l*uj l ) G d(T>°), and 
Z(l*El) = 0. Then Z is a symmetry ofT, and there is, at least locally, a function f on C such 
that Z = Zf and T(f) = 0. The set of vector fields Z satisfying these conditions forms a Lie 
algebra S. For Z\,Z 2 G S, with corresponding first integrals /i,/2, we have Zi(f 2 ) = —Z^fi), 
and the first integral corresponding to \Z\,Z 2 \ is (up to an additive constant) Zi(f 2 ). 

Proof. The vector field Z satisfies the conditions of Theorem [1] (see the remarks following it) 
and Corollary [3l If Z\,Z 2 G S, then Z 2 G T> and C Zl (T>) C V, so \Z\,Z 2 \ = £ Zl Z 2 G T>. It is 
easy to see that [Zi,Z 2 ] satisfies the other conditions, and also that k\Z\ + k 2 Z 2 , k\,k 2 G M, 
satisfies the conditions. Thus S is a Lie algebra. We have Z\{f 2 ) = 4*^(2^1,^2) = — ^(/i). 
From Proposition [5] we see that [Z x , Z 2 ]j l*oj l - d{Z 1 {f 2 )) G V°. □ 

3.3 Systems with maximally nonintegrable constraint distributions 

The statements in the previous sections can be further refined if one is in the situation where 
the distribution is 'as non-integrable as it can be' (in a sense we shall explain next). We shall 
show that one may actually assume without loss of generality that this is always the case. 

For any distribution T> on Q let [D] be the smallest involutive distribution containing T>; it 
consists of linear combinations of repeated brackets of vector fields in P, as the notation is 
designed to suggest. If [T>] = 3C(Q) we say that T> is maximally nonintegrable. Other authors 
use other adjectives, such as e.g. 'totally nonholonomic' [18] or 'completely nonholonomic' |31j . 

Let T>' be the distribution on TQ spanned by all vector fields X° and X v for X G T>. Then 
V' has the properties that t*P' = V and S{V) = V n V (where V is the vertical distribution), 
and it is determined by these properties. If T> is involutive so is P', since for any X, Y G T>, 
[X C ,Y C ] = [X,Y] C G V, [X C ,Y V ] = [X,Y} V G V, and of course [X v ,Y V ] = 0. Note that 
X(Q)' = X(TQ). 

Proposition 6. 

[v]' = [V]. 

Proof. Evidently V C [V]' and [V]' is involutive, so [V] C [V]'. On the other hand, [V] is 
spanned by the repeated brackets of complete and vertical lifts of vector fields in T>, and these 
(when nonzero) are complete or vertical lifts of vector fields in [V]: so [D]' C \D'\. □ 

Corollary 4. LfT> is maximally nonintegrable so is V . 

Now consider, for a constrained system with constraint distribution T>, those conserved quantities 
which are just functions on Q. Of course in the unconstrained case there aren't any; but in the 
constrained case the condition for / to be conserved is just that X{f) = for all X G T>. Then 
evidently X(f) = for X G [D] (see [20] for a similar statement). So these conserved quantities 
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are constant on the integral submanifolds of [T>]; and conversely, since T> C [T>]. So the leaves 
(maximal connected integral submanifolds) of [V] are the level sets of (an independent subset 
of) the conserved quantities /. 

Let us restrict everything to a leaf L of \D}. Note that T> is still a distribution on L; and since 
the base integral curves of T are everywhere tangent to T>, if they start in L they lie in L. 

Proposition 7. Let L be a leaf of [D], L, V the restrictions of L and T> to L, then (assuming 
that L is regular with respect to T>) the dynamical field of the constrained system on L defined 
by L and T> is just the restriction of T to TL n C . 

Proof. We can identify TL with the leaf of \V'\ in TQ which projects onto L, and the constraint 
submanifold C C TL corresponding to T> with TL n C. We know that T belongs to V , and 
is therefore tangent to C. Its restriction to C is uniquely determined by the restriction of the 
equations T(X V (L)) — X C (L) = 0, X G V, to C; but these are just the Lagrange-d'Alembert 
equations for the system on L. □ 

So without essential loss of generality we may assume that V is maximally nonintegrable: if not 
we just have to restrict to a leaf L of [V]. Suppose coordinates have been chosen on Q so that 
the leaves of [D] are given by x r = constant for an appropriate range of values of r: then these 
coordinates will of course appear in the expressions for base the integral curves of the restriction 
of r to L, but only as parameters which take the constant values appropriate to L, the leaf in 
which the curve lies. 

Let T> be as in the previous section. We also need to identify [D], the smallest involutive 
distribution on C containing T>. Now for any projectable vector fields Z\,Z 2 on C, [Z\,Z 2 ] is 
projectable, and t\c*[Zi, Z2} = [t^Zi, T\ Cif Z 2 }. Since [D] is spanned by repeated brackets of local 
basis vector fields of T>, it is projectable to Q, and its projection is an involutive distribution 
containing V. But by construction T|£*[2?] is spanned by repeated brackets of vector fields in V, 
so Tjc^fP] C [V], whence T|c, |t [T ) ] = \P\- Clearly the kernel of [V] under projection is V v . So [V] 
is determined by the fact that T| C ^[P] = [£>]; that is, [V] u = t^ c ^ u ~ 1 ([V] t ^). In particular, if V 
is maximally nonintegrable so is T>. 

Clearly if Z E V is such that Cz{l*ojl) = 0, so that Z is a symmetry of l*ujl, and dz^P) C V 
and Z(l*El) = 0, then Z satisfies the conditions of Theorem[2j When the constraint distribution 
is maximally nonintegrable we have the following partial converses. In the first we assume that 
T> is 2-step maximally nonintegrable. A distribution P on a manifold Q is 2-step maximally 
nonintegrable if V + [V, V] = X(Q). 

Proposition 8. // T> is 2-step maximally nonintegrable then a vector field Z € T> satisfying 
Va Cz{i*^l) € L)° , Cz{V) C T> and £z(i-*^l) € d(V°) is a symmetry of 

Proof. We have Cz(l*oj l ) = d<j> for some eft 6 T>° such that Yjdcj) £ V° for all Y G T>, or 
dcp{Yi,Y 2 ) = for all Y X ,Y 2 G f>. But for Y U Y 2 G T>, d(p{Yi,Y 2 ) = -(/)([Y 1 ,Y 2 }). If V is 2-step 
maximally nonintegrable, so is T>. So (ft vanishes on X(C), that is, Cz{i*ul) = 0. □ 

For the second result we need to strengthen the first condition of the theorem. 

Proposition 9. If T> is maximally nonintegrable then a vector field ZgP satisfying T>j jCz(^*^l) = 
0, Cz(T>) C V and Cz(l-*^l) £ d(V°) is a symmetry of 



10 



Proof. We have Cz{l*ojl) = # for some cj) € T>° such that Yj d<f> = for all Y G £>. We 
therefore consider the set S of those 1-forms ip € such that Yjdip = for all Y £ P, or 
equivalently £y0 = f° r au Y G £>. It is an R-linear subspace of T>°, and indeed a module 
over functions invariant under T>. Now if Y is any vector field on C such that ip(Y) = and 
Cyip = 0, and Y' = fY for any function / on C, then i/j(Y') = 0, and 

£ Y >ip = /(Yj cty) + d(fYj = fC Y ^ = 0. 

Moreover, if Y\, Y 2 satisfy ip(Yi) = ip(Y2) = and -Cy^ = ^y^^ = then 

il>m,Y 2 ]) = Y 1 (ip(Y 2 )) - £y^(Y 2 ) = 0, 

and 

^[Yi,y 2 ]^ = [C Yi ,CyM = 0. 

So if [T>] is the smallest involutive distribution containing T> then S C [P] - But if T> is maximally 
nonintegrable then [D] = X(C) and so S = {0}. Thus Cz{i*^l) = 0. □ 

4 The nonholonomic Noether theorem 

The title of this section refers to the terminology used in the paper [21] by Fasso et al. We shall 
first re-express their version of the theorem in the current framework. Next we shall show how 
it relates to the Cartan form approach. 

4.1 The theorem and the reaction-annihilator distribution 

In [11 j we defined (for any second-order field T) a 1-form e along the tangent bundle projection 
(restricted to C in the nonholonomic case) by e{X) = T(X V (L)) — X C (L) for X a vector field on 
Q. The Lagrange-d'Alembert principle is that there is a unique T of second-order type, tangent 
to C, such that e annihilates T>. The form e corresponding to that particular V is what Fasso et 
al., in [21], call the reaction set 1Z. The idea of Fasso et al. is that there may be vector fields 
Z on Q, not necessarily in V, such that e(Z) = 0; such a vector field belongs to the so-called 
reaction-annihilator distribution 1Z°. The next statement is the nonholonomic Noether theorem 
of [H EH [22]. 

Theorem 3. For a vector field Z on Q any two of the following three conditions imply the third: 
(1) Z C (L) = on C; (2) e{Z) = 0; (S) Z V (L)\ C is a first integral ofT. 

Proof. The proof is straightforward: we have T(Z V (L)) = Z°(L) + e(Z), so if any two of the 
terms vanish so does the third. □ 

We may equivalently express matters in terms of multipliers. Let {X a , X a } be a basis of vector 
fields on Q, where the X a span V. Evidently e(X a ) = on C, while e(X a ) = \ a , for some 
functions A a on C. These A a play the role of the Lagrangian multipliers one finds in many 
formulations of the equations of nonholonomic dynamics. Let Z = Z a X a + Z a X a ; then Z is 
in K° if and only if Z a X a = 0. By definition, Z a \ a = T(Z a X^(L)) - (Z a X a ) c (L). If now 
Z°(L) = 0, then (Z a X a )°(L) = —(Z a X a ) c (L). Replacing this above, and taking into account 
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the fact that T is such that (Z a X a ) c (L) = T(Z a X%(L)), we easily get that Z a X a = T(Z Y (L)), 
and the result follows. 

In this general situation, the conserved momentum Z V (L) may depend on the component of Z 
transverse to V. 

We could introduce the following small improvement to the above theorem: for any function / 
on Q, 

e(X) = T(X v (L)-f)-(X c (L)-f); 

if any two of the three terms vanish so does the third. If L is of mechanical type (meaning that 
it is of the form T — V, where T is associated to a Riemannian metric, and V is a potential) 
this adds nothing new, since if X C (L) = f then each side must be zero, by equating to zero the 
separate powers of u. Symmetries for which X°{L) = f are therefore only of interest for more 
general types of Lagrangians, e.g. for Lagrangians with magnetic terms. In that context, the 
Lagrangian is said to be quasi-invariant, see e.g. [30J. 

We shall now reinterpret the reaction-annihilator distribution 1Z° . It is not immediately clear 
which manifold 7Z° is supposed to be a distribution on. For each point (q, u) of C, there is a 
subspace of T q Q consisting of vectors v such that £( q , u )( v ) = ; but it will in general depend on u. 
For example, if the Lagrangian is of mechanical type with a potential term </>, then A a contains 
two terms, one quadratic in velocities and one independent of them: the first is what one gets 
from the kinetic energy term, the second is just X a {4>) (see the computations in Section 5.2). 
So the set {v G T q Q : v a X a (q,u) = 0} will depend on u, as we claimed above. 

One may impose further conditions so that 1Z° can be regarded as a distribution on Q. Indeed, 
there may very well be vector fields Z on Q such that Z a (q)X a (q,u) = for all u G C q (any 
ZgP will do). In the case of a Lagrangian of mechanical type, since the Z a are functions of 
q alone they must in fact satisfy two conditions: the one coming from the quadratic part of A a , 
and in addition Z a X a ((J)) = 0. It will become clear immediately below that to restrict attention 
to vector fields on Q in this way (i.e. to think of 1Z° as a distribution on Q) is to impose an 
unnecessary limitation, so we shall not insist on it. 

We can reinterpret e in terms of the fibre metric, much as we did in Section [2j In fact for any 
vector field X on Q, 

e(X) = T(X V (L)) - X C (L) = (T - T )(X V (L)) = g(T - T , X v ). 

From this perspective it is clear that if we require 1Z° to be in some sense a distribution, it must 
be interpreted as the distribution of vector fields along t which are fibre-normal to V — Tq. It 
will then consist of vertical vector fields on C C TQ, rather than vector fields on Q; but it may 
contain vertical lifts of vector fields on Q, and it is these which are of interest from the point of 
view of Theorem [3j 

4.2 Relation to the Cartan form approach 

We now discuss the nonholonomic Noether theorem of Fasso et al. from the Cartan form point 
of view. 

We first make one further interpretation of e. Notice that Tj ujl + dEi is a semi-basic 1-form 
along C, say e. For any v G T U (TQ) (where u G C), e u (v) = g a b(u)-d a (v)^ b = g u (S(v),T - T ), 
since S(v) = $ a (v)y a + ^"(u)^- That is to say, e u (v) = e u (t*v), or in other words e is e 
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considered as a semi-basic 1-form along C. The element of T>° determined by Tj l*ujl + cI(l*El) 
according to Proposition H] is i*e. 

Recall that for any vector field X on Q, the vector field X c satisfies C x cS = and [A, X°] = 0, 
where A is the Liouville field. 

For any Lagrangian system (with Cartan forms 6l and lol = dOi) 

X c joj l = X°a d6 L 

= c xC e L -d(x c j e L ) 

= C x c{S{dL)) -d(X v (L)) 
= S(d(X c (L)))-d(X^(L)). 

This holds everywhere on TQ, and regardless of whether L is regular. 
Secondly, 

e(X) = e(X c ) = (rj u L + dE L ){X c ) 

= -(X c auj l ){T)+X c {E l ) 

= -A(X C (L)) + T(X V (L)) + X C (A(L)) - X C (L) 
= F(X V (L)) -X C {L) 

as expected. This holds along C, but neither X c nor X v need be tangent to C. 
Thirdly, we propose an analogue of Theorem [3l 

Theorem 4. For any vector field Z tangent to C and for any function f onC such that Zj l*ujl — 
df € T>° , we have 

r(/) = Z{CE L ) - L*e(Z); 
and if any two of the terms vanish so does the third. 

Proof. This is a small generalization of the proof of Theorem [TJ □ 

(Note in passing that since L*e(Z f ) = (Tjl*u l + d{i*E L ))(Z f ) and Tjl*uj l + d(t*E L ) G V°, 
i*e{Z f ) = 0. Moreover, if Z^ l*uj l G V° then evidently i*e(Z) = Z(l*E l ).) 

We shall rederive Theorem [3] from the displayed formula in the statement of Theorem |4j that 
is, we shall show that Theorem [3] is a special case of our analogue theorem. Naively, one would 
like to substitute X° for Z: but this is not permissible since X° is not necessarily tangent to C, 
and in any case X° does not correspond directly to X v (L) via ujl- Let us denote by X° the 
projection of X° onto C along the y a , and let us set 

Z = -X c + g a ^(X c (L))y a . 

Then Z is tangent to C and satisfies Zjl*ujl — d(i*(X v {L))) G T>° . To see the latter, note 
that S(d(X c (L))) is semi-basic and S{d{X c (L))){X a ) = y a (X c (L)), whence S(d(X c (L))) - 
y a (X c (L))tf a G ($ a ). Recall the formula 

ujl = g a ^ a A & + g ab ip a Ai9 b + \u ab r A V b . 



13 



We have, on C, 

Zjtu L = -X c ju; L +y a (X c (L))i) a (modf) 

= -S(d(X c (L))) + d(X v (L)) + y a (X c (L))$ a (mod 
= d(X v (L)) (mod 0°), 

whence Zj l*oj l - d(i*(X v (L))) e £>°. Now since e is semi-basic, i*e(Z) = -e(X c ) = -e{X). It 
remains to calculate Z(l*El). For this purpose we require the following general result. Let V 
be any vertical vector field. Then from the general formula for the fibre metric g 

g(V, A) = V(A(L)) - V° V A(L) = V(A(L)) - V(L) = V(E L ), 

since V^A = V. Thus on C 

Z(E L ) = (-X c + g^y p (X c (L))y a )(E L ) 

= (-x c + ip a {x c )y a + g ^y p (x c (L))y a ){E L ) 

= —X c {Ei) + (p a (X c )g(y a , A) + g a ^ y p(X° (L))g{y a , A) . 

Since the constraints are linear, A is tangent to C, so g(y a , A) = 0. If we write A = v a y a then 
g(y a ,A) = g a pvP, and 

g^y^X c (L))g(y a ,A) = g^y p (X c {L))g a ^ 

= ^y p (X%L)) = A(X C (L)). 

It follows that on C, Z(E L ) = -X C {E L ) + A(X C (L)) = X C (L), and therefore Z(l*E l ) = 
l*{X c {L)). So the formula T(/) = Z{l*E l ) - L*e{Z) becomes T{l*(X v {L))) = l*X c {L) + e(X). 
This is for a particular choice of Z such that Zj l* uj l — d{i* (X Y (L))) G T>°. For any other choice, 
say Z', we have (Z - Z')j l*uj l £ V°, so that Z'(t*E L ) - i*e{Z') = Z(l*E l ) - i*e(Z), and the 
same conclusion holds. 

The drawback of the approach in Theorem HI however, is that the correspondence between first 
integrals and vector fields is no longer 1-1, as it was in Theorem [TJ 

4.3 Special cases 

A particular question of interest is whether, and under what conditions, a complete lift X° can 
satisfy the hypotheses of Theorem [2] on symmetries and first integrals. For this we require that 

1. X c £ V, 

2. Vj£ x c{l*lo l ) C f>°, 

3. C x c{V) C V, 

4. C x c(l*oj l ) £ d(V°), 

5. X c (l*E l ) = 0. 

A couple of points of notation. 
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• For a distribution D, we denote by T> its first derived distribution, which is the distribution 
spanned by V and brackets of vector fields in V, that is, V 1 = V + [D,V]. (Thus V is 
2-step maximally nonintegrable just when T> 1 = X{Q).) 

• We denote projectable vector fields on TQ, and more particularly on C, with overbars; 
thus Y is projectable, and we set t*Y = Y. Note that S(Y) = Y v . 

Lemma 1. A vector field X on Q is an infinitesimal symmetry of the distribution T> (that is, 
it satisfies C X (T>) CD) if and only if X° is tangent to C. 

Proof. Let (v a ,v a ) be the quasi-velocities corresponding to the frame {X a ,X a }, as in Then 
v a = on C. One easily verifies that X c (v a ) = if and only if [X, X a ] is of the form AaXp. □ 

Lemma 2. For a vector field X on Q, if X° G T> then C x c(T>) C T>. 

Proof. First of all, since X° is evidently tangent to C, Cx{D) C T>. We have to show that for any 
vector field Z G V, and for any u G C, (C x cZ) u is tangent to C and t*{C x cZ) u € T> T ^ u y It will 
be enough to consider those vector fields in T> which are projectable. Let Y be any projectable 
vector field in 2?, so that Y £ T>: then C x cY is tangent to C since both X° and Y are; it is 
projectable, and its projection C X Y belongs to V. □ 

So in this case condition 3 is superfluous. 

Proposition 10. For a vector field X on Q, X° satisfies the conditions 

1. X c G V, 

2. Vj£ x c(l*u} L ) C V°, 

3. C x c(l*oj l ) G d(D°), 

4. X c (l*E l ) = 0, 

if and only if 

• C X (V) C V, 

• X G V, 

• there is a (locally defined) function F on Q such that X C (L) = F on C, 

• for any Y G V 1 , Y V (X C (L)) = Y(F) on C. 

(Notice that if Y G V, so that Y v is tangent to C, Y V (X C (L)) = Y V (F) = Y(F) on C, so the 
final condition is automatically satisfied. So that condition is really concerned with derivatives of 
X C (L) in directions transverse to C; that is, it says something about how X C (L) changes as one 
moves off C. Moreover, the transverse directions involved are those that arise from bracketing 
vector fields in D.) 
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Proof. As a preliminary step we evaluate C x c{l*9l). Firstly, C x c6l = S(d(X°(L))), whence 
C x c9l(Y) = Y V (X C (L)) and C x c6l vanishes on vertical vector fields (both of these assertions 
holding everywhere on TQ). Thus £ x c(l*9l) vanishes on any vertical vector field tangent to C, 
while for any Y tangent to C, C x c{i*9 L ){Y) = l*{Y v {X c {L))). 

Now suppose that C X {V) C V, X G V, and on C, X C (L) = F and Y V (X C (L)) = Y(F) for any 
Y G D 1 . We show that the numbered conditions are satisfied. 

Since C X (V) C V, X c is tangent to C. Furthermore, t\ c *X c = X G V, so X c G V, which 
establishes that condition 1 is satisfied. For Y G V, so that Y v is tangent to C, we have 

£ x c(t*e L )(y) = Y v (l*(X c (L))) = Y V (F) = Y(F) 

(strictly speaking, t\q(Y(F))). It follows that C x c(l*9l) — dF G Z?° (since both terms vanish on 
vertical vector fields). Thus C x c{l*ujl) & d(T>°), and condition 3 is satisfied. We next consider 
condition 2. It can be written dC x c(L*0L)('D,'D) = 0. Using the usual formula for the exterior 
derivative it is easy to see that if either or both of the arguments is vertical then one gets zero. 
For Yi,Y 2 €V, 

d£ x c(L*9 L )(Yi,Y 2 ) = Yi(C x c(i*e L )(Y 2 ) - Y 2 (C x c(i*e L )(Y 1 ) 

= Y X (Y 2 (F)) - Y^F)) - C x c(i*0 L )QY u Y 2 ]) 
= [Y 1 ,Y 2 ](F)-S([Y 1 ,Y 2 nX c (L))) 
= 

because Y Y \X°{L)) = Y{F) on C for any Y G V 1 . Finally, condition 4 follows directly from the 
fact that A(F) = F. 

For the converse we shall make use of a frame {Xi} on Q with {X a } a local basis for V, as 
usual. We take vector fields Xi on (and tangent to) C projecting onto the Xf. they could be the 
fibre-orthogonal projections onto C of the Xf , for example. (These are not to be confused with 
the X a etc., which are not necessarily projectable.) Then S(X a ) = X%, and {X a ,X^} is a basis 
for V. We denote by v l the corresponding quasi-coodinates; v a = on C. 

Suppose that X° satisfies the numbered conditions. It follows from condition 1, firstly that X° 
is tangent to C and so C X (V) C V, and secondly that X = T\ Cif X° G V. Condition 3 implies 
the existence (locally) of a function F on C such that C x c{l*6l) — dF G T>° . Since £ x c(i*9l) 
vanishes on any vertical vector field V tangent to C, and all such vector fields belong to V, it 
follows that V(F) = for all such V, so F is (the pull-back of) a function on Q. Then 

X a (F) = £ x c(L*e L )(X a ) = X^(X C (L)), 

whence by condition 4 

X°(L) = A(X°(L)) = v a Xl{X c {L)) = v a X a (F) = F 

on C. From the calculation of dC x c{t* 9 l){Yi,Y 2 ) above we see that on C, for any Y G V 1 , 
Y Y (X C (L)) = Y(F). □ 

Corollary 5. The numbered conditions are satisfied if X is a horizontal quasi-symmetry of the 
system, that is, if C X (T>) C T>, X G T>, and for some function F on Q, X C (L) = F holds on 
TQ. In fact C x c(l*ujl) = in this case. 
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Proof. We have Y V (X C (L)) = Y(F) for any vector field Y on Q. It follows that £ x c(i*G L ) = 
dF. □ 



We may conclude, from Theorem [2j that if X is a horizontal quasi-symmetry then X c is a 
symmetry of l*ujl and of T, and there is, at least locally, a function / on C such that X° = Zf 
and r(/) = 0. This result, with f = F — l*(X v (L)) of course, is well-known; the point of the 
exercise was to see how it is related to the theorem. To turn things around, we may say that the 
vector fields Z satisfying the hypotheses of Theorem [2] should be regarded as generalizations of 
horizontal quasi-symmetries. 

5 Applications 

The bulk of the literature concentrates on the case where the Lagrangian is of mechanical type. 
So we now explain how the theory works in that special case. A simple subcase, but one which 
contains several instructive pointers to what happens in general, is furnished by submanifolds 
in Riemannian geometry. 

5.1 Conservation laws in Riemannian geometry 

First, some elementary remarks about geodesic conservation laws in Riemannian geometry. 

Let (M, g) be a Riemannian manifold, with Levi-Civita connection V. For any vector field Z 
on M, define a type (0,2) tensor Kz by Kz(u,v) = g(V u Z,v). The necessary and sufficient 
condition for Z to be a Killing field (infinitesimal isometry) is that Kz is skew-symmetric; in 
fact £ z g(u,v) = K z (u,v) + K z (v,u). 

For any curve c in M, and any vector field Z, 

^-(g{Z, c)) = g(V 6 Z, c) + g(Z, V 6 c) = K z (c, 6) + g(Z, V 6 c). 
at 

If c is an affinely parametrized geodesic, so that VcC = 0, and Kz(c,c) = 0, then g(Z,c) is 
constant along c. If Z is a Killing field then g(Z, c) is constant along every geodesic; and 
conversely (since there is a geodesic in every direction, so c is an arbitrary vector). This, in fact, 
is Noether's theorem in Riemannian geometry: there is a 1-1 correspondence between geodesic 
invariants of the form g(Z, c) and infinitesimal symmetries, that is, isometries or Killing fields. 

Now consider the case of a Riemannian submanifold N of a Riemannian manifold (M, g) (the 
metric on N is the restriction of g). The second fundamental form II is a type (0,2) tensor on 
N with values in the normal bundle, defined as follows. For any vector fields rj, ( on N, set 
n(f?, C) = ^r/C^ (t ne normal component of V^C)- For any function / on N we have V^/C = 
/VjyC + rj{f)Q, and so II(r/,/C) = fH(r],(), and II is tensorial. Moreover, V V C — V^?? = [rj, £] 
and the latter is tangent to N, which implies that II is symmetric in its arguments. 

A curve c on N is geodesic with respect to the induced metric if and only if VcC is normal to N 
(here V is the Levi-Civita connection of (M,g)). If c is geodesic then VcC = II(c, c). 

Now consider a vector field Z defined in a neighbourhood of N in M. Then for a geodesic c in 
N, 

±(g(Z, c)) = K z (c, c) + g(Z, V £ c) = K z (c, c) + g(Z, U(c, c)). 
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Of course if Z is tangent to N the last term vanishes and the situation reduces to the one 
discussed earlier. But suppose that Z is not tangent to N, but satisfies the following two 
conditions: the restriction of Kz to TN is skew, and Z is orthogonal to the second fundamental 
form of N. (Of course the codimension of N must be greater than 1 for this to be possible for 
Z not tangent to N.) Then g(Z, c) is constant along every geodesic of N. In fact any two of the 
following conditions implies the third: 

1. Z is orthogonal to the second fundamental form of N; 

2. the restriction of Kz to TN is skew; 

3. g(Z, c) is constant along every geodesic of N. 

The first and last of these depend only on the values of Z on N. Recall that Kz(u,v) = 
g(V u Z,v), so that Kz involves derivatives of Z: but since we are interested only in the case 
where u and v are tangent to N, we have to take derivatives only in directions tangent to N, 
so that Kz(u,v) also depends only on the values of Z on N. In other words, one could state 
the result as follows: let Z be a vector field on N (but not necessarily tangent to it — that 
is, strictly speaking Z is a vector field along the injection of ./V into M); then any two of the 
conditions above imply the third. 

Although Z need not be tangent to N, the conserved quantity g(Z, c) depends only on its 
component tangent to N. 

This result is of course the nonholonomic Noether theorem of [21] . for a kinetic energy La- 
grangian, in the case where the constraints are actually holonomic! Let us consider the result 
in this light. The equivalence between the condition Z C (L) = on C and item 2 above is dealt 
with below. The function Z V (L) on C is more-ordess g(Z,c) (the latter is the former restricted 
to a base integral curve of E); note that since T is tangent to C, in computing T(Z V (L)) we can 
restrict Z V {L) to C, i.e. we can set v a = before acting with E. (The conserved quantity depends 
only on the component of Z along the constraint distribution, just as we pointed out above for 
the submanifold case.) The new ingredient is the identification of the condition e(Z) = in 
terms of the second fundamental form. In fact the definition of the second fundamental form 
given above extends in a fairly obvious way to a distribution T> (assuming of course one has a 
metric): for any vector fields X, Y in T> set 

n(X,Y) = ±(V x Y + V Y X) ± 

where Z 1 - is the component of Z perpendicular to T>. Note that symmetry is no longer automatic 
- since we are now dealing with a nonholonomic distribution we won't have [rj, £] E V. The 
condition e(Z) = is just Z ± EL 

5.2 Lagrangians of mechanical type: the general case 

We shall derive expressions for e for a Lagrangian of mechanical type, 

L(q,u) = \g q {u,u) - 4>{q). 

The fibre metric essentially coincides with the metric g on Q, at least so far as vertical lifts are 
concerned. 
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First of all, X Y (L) = g(X,u), while X C (L) = g(V u X,u) - X ((/>). Let us take a frame {X a ,X a } 
in which the X a span T> and the X a are normal to T>, so that g aa = 0. As before we denote the 
corresponding quasi-velocities by (v a ,v a ), and we denote by W a gX^ the component of [X a ,Xp] 
in V. We write T = v a X^ + T a X^. Recall that T is tangent to C, and that v a = on C, so that 
u = v a X a on C, and (for example) g(X a ,u) = g a pv^ . Moreover, v^X^v 1 ) = —R^v^v = on 
C (by skew-symmetry). It follows that 

e(X a ) = (v?X% + rPx))(g(X a ,u)) - g(V u X a ,u) + X a (cj>) 

= g a ^ + A>( 5ct >V - 3 (V^I a ,I 7 )/^ + X a (<f>) 
= g a ^ +g(X a ,V Xp X 7 yv"< + X a (0), 

from which we can determine V. The calculation of e(A A a ) is much simplified by the choice of 
X a , since g(X a ,u) = g(X a ,v a X a ) = on C: thus 

e(X a ) = -g(V Xa X a ,X p )v a v p + X a (<j ) ) = g(X a , V Xa Xp)v a v? + X a (<f>). 

In the first term in the final expression, only the component of \7 x a Xp normal to T> matters. 
Let us write II Q( g = n^ a A 7 a for the symmetric part of the normal component of V x a Xp (i.e. the 
generalized second fundamental form). Then 

e(X a ) =g ab Il b a[S v a vP + X a (ct>), 

or more generally, for any vector field Y on Q normal to T>, 

e{Y) =g(Y,U(u,u)) + Y^). 

Since e annihilates T> and II is normal to it, we conclude that a vector field Z on Q satisfies 
e(Z) = if and only if g(Z,H aj 3) = and Z^{(\>) = where Z^ is the component of Z normal 
to T>. These are the conditions we alluded to in the discussion on the reaction-annihilator 
distribution of Section 4.1. 

If g is actually a flat metric (as is the case in many examples) then the second fundamental form 
condition is vacuous. 

The conditions for Z to generate a conserved momentum are that 

g(Z,n a p) = 0, Z ± ( ( f>) = 0, 
g{V Xa Z,X /s )+g(V Xf) Z,X a ) = 0, Z(<f>) = 0. 

Regarding the potential note that in fact it isn't enough that Z(cj)) = 0: in effect, both the 
component of Z along T> and the component normal to T> separately have to annihilate 4>. The 
condition g(Vx a Z, Xp)+g{V x^ Z, X a ) = just says that Kz, restricted to T>, is skew-symmetric. 
This result may also be found in e.g. [25J. 

Finally, we make the link to the Riemannian case, as described in Section 15.11 The Euler- 
Lagrange field To is the geodesic field. Consider a curve c in Q, and its natural lift C = (c, c) 
to TQ. It is easy to see (by a coordinate calculation for example) that the tangent field to the 
natural lift can be expressed as follows: 

c = r b + (v,c) v . 

For a constrained system of this type, therefore, c will be a base integral curve of the constrained 
dynamical field V if and only if VcC is normal to T>\ c . This includes the case in which T> is 
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integrable, giving the rule for geodesies in a submanifold. In fact for a base integral curve c of 
r we have (r - r )c = (V £ c) v = U(c, c) v . So the formula 

j t (g(Z,c)) = K z (c,c) + g(Z,U(c,c)) 

(see above) is the formula 

T(Z v (L)) = Z c (L)+g(T-T ,Z v ) 
for this case (along the natural lift C of c). 



5.3 Quadratic first integrals for constrained systems of mechanical type 

In the previous example we have used the conditions of the nonholonomic Noether theorem as 
discussed in Section [H In this section we give an example where we use the more general results 
of the section on the Cartan form approach. More in particular, we shall apply the conditions 
of Theorem [TJ 

First, a remark about connections. Let E — > M be a vector bundle with linear connection, 
with covariant derivative operator V and horizontal lift X i— > X H . Any section a of E — > M 
determines a vertical vector field on E by the identification of a point of a vector space with a 
(constant) vector field on it, ie by a variant of the vertical lift construction: call it therefore <r v . 
Then [X H ,a v ] = (V x <t) v . 

Now let F — > M be a vector sub-bundle of E, and suppose that E is equipped with a constant 
fibre metric g. For any v £ F, let H v be the fibre-orthogonal projection of the horizontal 
subspace H v of T V E into T V F, and X i— > X H the corresponding horizontal lift to F. Then for 
any section r of F — > M, [X h ,t v ] is a vertical lift, and if we set LY h ,t v ] = (Vxr) v then V is 
the covariant derivative operator of a linear connection on F. 

For x € M, let {ei(x)} be a basis for E x such that {e a (x)} is a basis for F x and {e a (x)} a basis 
for its orthogonal complement. Suppose that we have, locally, such a choice of bases depending 
smoothly on x; in other words, we have local sections ej with these properties. Let {X r } be a 
local basis of vector fields on M, and set V x r ^i = ^ri e fi ^ri are ^ e connection coefficients 
of V with respect to the chosen bases. Let (u 1 ) be the fibre coordinates on E corresponding to 
the basis of sections {e^}, so that e^{u 3 ) = 5j ; note that F is the submanifold u a = 0. Then 

lX?,eJ](un = X?(5l)-eJ(X?(uJ)) 
= (V* P e,) V) = C 

so that 

X r V) = -L>*, 

as one would expect. Now Xf — Xf 6 (e a ), say X^ 1 — X" = £r e a- Thus on F (where u a = 0) 
= — r r ^u Q , using the fact that X,?(it a ) = since Xf is tangent to the submanifold F. It 
follows that Xf = Xf + T r a a u a e v a . So finally 

\x n p v i = [y H P v i -i- fr a i/' 3 p v p v i = r* p v - r a p v = r^ 3 p v - 
that is to say, the connection coefficients for V are Fra. 
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Now let us specialize to the case in which E = TQ and F = C, for a constrained system of 
mechanical type. Choose the basis of vector fields {X a ,X a } with {X a } a basis for V and X a 
orthogonal to T>; then X^ corresponds to ej above. We take for V the Levi-Civita connection 
of the kinetic energy metric; we have an induced connection V on sections of C, i.e. on vector 
fields in T>, with V^Xa = T^ a Xn. This is all very much like the definition of the connection 
induced on a submanifold in Riemannian geometry (the connection V may in fact be found in 
e.g. |29|): note that indeed the generalized second fundamental form is given by 

Ha/3 = 2 ftaP + ^/3a) • 

Moreover, 

r = v a X° - (l»V + (grad^)^. 

We have written gradt/> for the vector field obtained by raising the index on dtp: that is to 
say, g(X, grad</>) = X(<f)). The corresponding term in T is (the vertical lift of) the orthogonal 
projection of grad</> onto T>. With our present choice of basis this is g a @ Xp((j))X a , where (g a/3 ) 
is the inverse of (gap), and also the corresponding block in the inverse of (gij). Likewise, the 
fact that we have Yp in the first term in the brackets is due to the choice of basis with X a 
orthogonal to T>, and therefore X^ fibre-normal to C. Now on C (where v a = 0) 

To = v a X c a - (r>V + (grad0rK v , 

so that 

r - r = (n^V + (grader) x a v . 

The vector fields {X*, I a H , X%, X%} form a local basis for X(TQ), of course: denote by {8 a , 6 a , cp a ,(j) a } 
the dual basis of 1-forms. We have 

the vector fields {Xf , X%} form a basis for X(C), with {X™, X^} a basis for V and S(Xf) = X%. 
Notice that since (XZ,0 a ) = (X^,6 b ) = (XX, ^ = 0, {l* 6 a , l* 6 a , l* cf) a } is the dual basis of 1- 
forms on C, and {i*6 a } is a basis for T>° . We don't claim that these bases correspond exactly to 
the ones used in Section 3, but nevertheless we can use them to analyse in the same way 
as we did there. 

We have 

UL = g a ?r *o $ + g ab r /\0 b . 

Using the expression for u)l given above we obtain, on C, 

X*j uj l 
Xfj u L 
X^ J uj l 

Then for any vector Z = + £ a X% + rfX^ tangent to C we find that 

Zj uj l (XX) = g af3 rf - g ab r^v^ b 

Zju L (xf) = g ab r a b ^c + (g ac r b c a - g bc r a c a )v a e 

Zju L (XX) = g a ^. 



= -g a ?r +ga b v^e b 
= -g ab ^ b + g bc r a b a v a e c 
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Thus the characteristic vector fields of ujl take the form Z = £, a X^ + r] a X^ where r/ a = 
g a ^gabFpyV' r £ b , and £ b satisfies 

(ga C r b c a -g bc r a c a )e = o- 

Now the connection coefficients are those for the Levi-Civita connection, but with respect to an 
anholonomic frame such that g aa = 0. It follows that gbc^aa + 9ai3^ab = 0- Thus the condition 
on £ a may be written 

- rf 6 )£ 6 = o. 

But since the Levi-Civita connection has no torsion, [X a ,Xb] = (T* b — T b l a )Xi. So the condition 
on £ b amounts to [£ a X a ,Xb] G £>-*-, or more generally J«, D 1 ] 6 V^. That is, £ Q X a (a vector 
field in T>^-) has the property that its bracket with every vector field in T>^ belongs to T>^ . 
This condition determines a subdistribution of T>^~, which is easily seen to be integrable (by the 
Jacobi identity). It's obviously a fundamental feature of D 1 - (or indeed of any distribution). It is 
perfectly possible, if the dimensions are right, for this distribution to reduce to the zero vector, 
in which case is symplectic: this should be an interesting class of constrained systems. The 
so-called nonholonomic particle in 3 dimensions is an example. 

Now C is a vector bundle, and so has associated with it tensor bundles of all types, which we 
shall call C -tensor bundles. Let A be a symmetric type (0, 2) C-tensor. With respect to the local 
basis of sections of C, A has components A a ^\ we can think of A as the corresponding 

collection of functions on Q, with the transformation rule induced by that of a change of basis 
for T>. Then A a pv a v^ is a well-defined function on C. Let / be a function on Q. We seek the 
conditions for ip = A a pv a v^ + / to be a conserved quantity for V. The vector field € T> from 
Theorem [1] determined by ip is easily seen to be given by 

g^(X^)Xj-X^)Xf), 

so ip is conserved if and only if 

g^{X^)Xj(E L ) - X^)Xf(E L )) = 0. 

In the computation it helps to note that we can replace X^ by X% since there are no occurrences 
of v a ; Xa(vP) = — L^yf 7 on C; and X^(T) = where T is the kinetic energy. We obtain 

(X a (A M ) - A Sl T*p - i^r*)//^ + (X a (f) - 2A^(grad<A)> a = °- 

So the necessary and sufficient conditions for ip = Ap^v^v" 1 + / to be a constant are that 

V {a A M) = 0, X a {f) = 2A a/3 (grad(^. 

(We have used brackets around indices to denote symmetrization, so that for example Tuj^\ are 
the components of the symmetrized tensor derived from a tensor T, ie ^ J] ^ijk where the sum 
is over all permutations of i, j and k. In view of the assumption that A a p is symmetric in its 
indices, V ( a Ap^ is in this case a constant multiple of the cyclic sum.) 

This is a direct analogue of the corresponding result for the unconstrained system (see [10J). 
That this is so is due to the fact that we are working with an anholonomic frame, to the choice 
of frame, and to the use of the induced connection V. 

The next question is what happens if A is actually the restriction of a tensor on Ql The answer 
is easier to state if we assume that the tensor on Q is also symmetric, but this is not essential — 
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all we require is that the restriction to T> is symmetric. On the other hand, there is no significant 
loss of generality in working with symmetric tensors. So let A be a symmetric type (0, 2) tensor 
on Q. For the restriction to C of the function AijV l v 3 + / to be a conserved quantity for T 
the components A a g of A must satisfy the conditions above. Since A is a tensor on Q we can 



We are not forced to take A a ( a Hp \ = 0, though it is obviously convenient if it holds. The 



Finally suppose that AijV l v 3 + f is a conserved quantity for the unconstrained system. Under 
what conditions is it also a conserved quantity for the constrained system? From the assumption 
that it is conserved for the unconstrained system we have 



From the first, in particular V( a ^4g 7 ) = 0, so we require in addition that A a ^ a H^ = 0. From the 
second, X a (f) = 2A a p(grad (ft) 13 + 2A aa (gr&d 4>) a , so we require in addition that A aa (grad 4>) a = 
0. 

These results include and extend those of Iliev and Semerdzhiev [241 [25] on quadratic integrals. 
It is easy to extend them also to constants of higher degree in v; however, this is the most 
interesting case because of the interaction between / and <j). For higher degree constants of the 
form A a /3.„$v a vP ■ ■ ■ v s + f the conditions corresponding to the first set are 
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